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Abstract. We consider stochastic differential equations of the form dYt = V (Yt) dX t + Vo (Yt) dt 
driven by a multi-dimensional Gaussian process. Under the assumption that the vector fields Vb 
and V = (Vi, . • • , Vj) satisfy Hormandcr's bracket condition, we demonstrate that Yt admits a 
smooth density for any t £ (0, T], provided the driving noise satisfies certain non-degeneracy 
assumptions. Our analysis relies on an interplay of rough path theory, Malliavin calculus, and 
the theory of Gaussian processes. Our result applies to a broad range of examples including 
fractional Brownian motion with Hurst parameter H > 1/4, the Ornstein-Uhlenbcck process and 
the Brownian bridge returning after time T. 



1. Introduction 

Over the past decade our understanding of stochastic differential equations (SDEs) driven by 
Gaussian processes has evolved considerably. As a natural counterpart to this development, there 
is now much interest in investigating the probabilistic properties of solutions to these equations. 
Consider an SDE of the form 

dY t = V(Y t )dX t + Vq (Y t ) dt, Y(0) = y Q eR e , (1.1) 

driven by an M d -valucd continuous Gaussian process X along C^°-vector fields Vq and V = (V\ , . . . , Vd) 
on R e . Once the existence and uniqueness of Y has been settled, it is natural to ask about the 
existence of a smooth density of Y t for t > 0. In the context of diffusion processes, the theory 
is classical and goes back to Hormander [55] for an analytical approach, and Malliavin [57] for a 
probabilistic one. 

For the case where X is fractional Brownian motion, this question was first addressed by Nualart 
and Hu |23| , where the authors show the existence and smoothness of the density when the vector 
fields are elliptic, and the driving Gaussian noise is fractional Brownian motion (fBM) for H > 
1/2. Further progress was achieved in [1] where, again for the regime H > 1/2, the density was 
shown to be smooth under Hormander's celebrated bracket condition. Rougher noises are not 
directly amenable to the analysis put forward in these two papers. Additional ingredients have 
since gradually become available with the development of a broader theory of (Gaussian) rough 
paths (see [25], jS], [H]). The papers [7J and [6] used this technology to establish the existence of a 
density under fairly general assumptions on the Gaussian driving noises. These papers however fall 
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short of proving the smoothness of the density, because the proof demands far more quantitative 
estimates than were available at the time. 

More recently, decisive progress was made on two aspects which obstructed the extension of 
this earlier work. First, the paper [5] established sharp tail estimates on the Jacobian of the flow 
Ju-o(yo) driven by a wide class of (rough) Gaussian processes. The tail turns out to decay quickly 
enough to allow to conclude the fmiteness of all moments for j£_ (yo). Second, [2T] obtained 
a general, deterministic version of the key Norris lemma (see also [24] for some recent work in 
the context of fractional Brownian motion). The lemma of Norris first appeared in [53] and has 
been interpreted as a quantitative version of the Doob-Meyer decomposition. Roughly speaking, 
it ensures that there cannot be too many cancellations between martingale and bounded variation 
parts of the decomposition. The work |21j however shows that the same phenomenon arises in a 
purely deterministic setting, provided that the one-dimensional projections of the driving process 
are sufficiently and uniformly rough. This intuition is made precise through the notion of the 
"modulus of Holder roughness" . Together with an analysis of the higher order Malliavin derivatives 
of the flow of (jl.lj) . also carried out in [21], these two results yield a Hormander-type theorem for 
fractional Brownian motion if H > 1/3. 

In this paper we aim to realise the broader potential of these developments by generalising the 
analysis to a wide class of Gaussian processes. This class includes fractional Brownian motion with 
Hurst parameter H £ (i, i], the Ornstein-Uhlenbeck process, and the Brownian bridge. Instead 
of focusing on particular examples of processes, our approach aims to develop a general set of 
conditions on X under which Malliavin-Hormander theory still works. 

The probabilistic proof of Hormander's theorem is intricate, and hard to summarise in a few 
lines, see [18] for a relatively short exposition. However, let us highlight some basic features of the 
method in order to see where our main contributions lie: 

(i) At the centre of the proof of Hormander's theorem is a quantitative estimate on the non- 
degeneracy of the Malliavin covariance matrix Ct (w). Our effort in this direction consists 
in a direct and instructive approach, which reveals an additional structure of the problem. 
In particular, the conditional variance of the process plays an important role, which does not 
appear to have been noticed so far. More specifically, following [7] we study the Malliavin 
covariance matrix as a 2D Young integral against the covariance function R(s,t). This 
provides the convenient representation: 



for some 7-Holder continuous / (v;u>), which avoids any detours via the fractional calculus 
that are specific to fBM. Compared to the setting of [5] we have to impose some additional 
assumptions on R(s,t), but our more quantitative approach allows us in return to relax 
the zero-one law condition required in this paper, 
(ii) An essential step in the proof is achieved when one obtains some lower bounds on v T CtV 
in terms of |/|oo;[o,tl- Towards this aim we prove a novel interpolation inequality, which lies 
at the heart of this paper. It is explicit and also sharp in the sense that it collapses to a 
well-known inequality for the space L 2 ([0, T]) in the case of Brownian motion. Furthermore, 
this result should be important in other applications in the area, for example in establishing 
bounds on the density function (see [2] for a first step in this direction) or studying small- 
time asymptotics. 
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(iii) Hormandcr's theorem also relies on an accurate analysis and control of the higher order 
Malliavin derivatives of the flow J^_ (yo). This turns out the be notationally cumbersome, 
but structurally quite similar to the technology already developed for fBm. For this step 
we therefore rely as much as possible on the analysis performed in [21]. The integrability 
results in [5] then play the first of two important roles in showing that the flow belongs to 
the Shigekawa-Sobolcv space B 00 ^ 6 ). 

(iv) Finally, an induction argument that allows to transfer the bounds from the interpolation 
inequality to the higher order Lie brackets of the vector fields has to be set up. This 
induction requires another integrability estimate for J^ Q (y ), plus a Norris type lemma 
allowing to bound a generic integrand A in terms of the resulting noisy integral J A dX in 
the rough path context. This is the content of our second main contribution, which can be 
seen as a generalisation of the Norris Lemma from [21] to a much wider range of regularities 
and Gaussian structures for the driving process X. Namely, we extend the result of [21] 
from p-rough paths with p < 3 to general p under the same "modulus of Holder roughness" 
assumption. It is interesting to note that the argument still only requires information about 
the roughness of the path itself and not its lift. 

Let us further comment on the Gaussian assumptions allowing the derivation of the interpolation 
inequality briefly described in Step (ii) above. First, we need a standing assumption that regards the 
regularity of R(s, t) (expressed in terms of its so called 2D p- variation, see [H]) and complementary 
Young regularity of X and its Cameron-Martin space. This is a standard assumption in the theory 
of Gaussian rough paths. The first part of the condition guarantees the existence of a natural lift of 
the process to a rough path. The complementary Young regularity in turn is necessary to perform 
Malliavin calculus, and allows us to obtain the integrability estimates for J^_ (yo) in [5]. 

In order to understand the assumptions on which our central interpolation inequality hinges, let 
us mention that it emerges from the need to prove lower bounds of the type: 

/ fsftdR{s,t)>C\f\;. [0!T] \f\^ 0iT] , (1.2) 

J[0,T]X[0,T] 

for some exponents 7 and a, and all 7-Holder continuous functions /. After viewing the integral 
in (|1.2[) along a sequence of discrete-time approximations to the integral, relation (|1.2p relies on 
solving a sequence of finite dimensional partially constrained quadratic programming (QP) prob- 
lems. These (QP) problem involve some matrices Q whose generic element can be written as 
Qv = E[Xl. t . +1 X}. {. + J, where X}. t . +i designates the increment X£ — X£. of the first compo- 
nent of X . Interestingly enough, some positivity properties of Schur complements computed within 
the matrix Q play a prominent role in the resolution of the aforementioned (QP) problems. In 
order to guarantee these positivity properties, we shall make two non-degeneracy type assumptions 
on the conditional variance and covariance structure of our underlying process X 1 (see Conditions 
[2] and [3] below) . This is obviously quite natural, since Schur complements are classically related to 
conditional variances in elementary Gaussian analysis. We also believe that our conditions essen- 
tially characterise the class of processes for which we can quantify the non-degeneracy of Ct(w) in 
terms of the conditional variance of the process X. 

The outline of the article is as follows. In Section [5] we give a short overview of the elements 
of the theory of rough paths required for our analysis. Section [3] then states our main result. In 
Section [4] we demonstrate how to verify the non-degeneracy assumptions required on the driving 
process in a number of concrete examples. The remainder of the article is devoted to the proofs. 
First, in Section [5] we state and prove our general version of Norris 's lemma and we apply it to 
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the class of Gaussian processes we have in mind. In Section |5J we then provide the proof of an 
interpolation inequality of the type (|1.2[) . In Section [7] we obtain bounds on the derivatives of the 
solution with respect to its initial condition, as well as on its Malliavin derivative. Finally, we 
combine all of these ingredients in Section |8] in order to prove our main theorem. 



2. Rough paths and Gaussian processes 

In this section we introduce some basic notation concerning rough paths, following the exposition 
in [5]. In particular, we recall the conditions needed to ensure that a given Gaussian process has a 
natural rough path lift. 

For JVeN, recall that the truncated algebra T N (R d ) is defined by T N (R d ) = 0^ =o (K d )®", with 
the convention (R^)® = M. T N (M. d ) is equipped with a straightforward vector space structure, 
plus an operation ® defined by 

N 
fe=0 

where 7r„ designates the projection on the nth tensor level. Then (T N (M. d ), +, <g>) is an associative 
algebra with unit element 1 £ (M^)® . 

At its most fundamental, we will study continuous M. d - valued paths parameterised by time on a 
compact interval [0, T] ; we denote the set of such functions by C([0, T], R d ). We write x Syt ■= x t — x s 
as a shorthand for the increments of a path. Using this notation we have 

. Vp 

SUP \ Xt \, IMI p _var;[0,T] — ( SU P ^ |x t t 

*e[o,T] \fl[0,T]=( tj ) j:t . eD[0>T] 

for p > 1 , the uniform norm and the p- variation semi- norm respectively. We denote by C p_var ( [0, T] , R d ) 
the linear subspace of C([0,T],R d ) consisting of the continuous paths that have finite p-variation. 
Of interest will also be the set of 7-Holdcr continuous function, denoted by C 7 ([0, T], R d ), which 
consists of functions satisfying 

II II ^ 

Fll 7 ;[0,T] : = SU P TT ~yy < °°- 

0<s<t<T \t — S\ 

For s < t and n > 2, consider the simplex A™ t = {(ui, . . . , u n ) £ [s, t] n ; «!<•••< u n }, while 
the simplices over [0, 1] will be denoted by A™. A continuous map x : A 2 — > T N (M. d ) is called a 
multiplicative functional if for s < u < t one has x s j = x Si „ g§ x Mjt . An important example arises 
from considering paths x with finite variation: for < s < t we set 



l<i\ , . . . ,i n <d 



dx h ■ ■ ■ dx in I e tl ® ■ ■ ■ <E> e ln , (2.1) 



where {ei, . . . , e^} denotes the canonical basis of R d , and then define the signature of x as 



S N (x) : A 2 -+ T N (R d ), (s, t) h-> S N {x) S;t := 1 + ) 



N 
n=l 



Spf(x) will be our typical example of multiplicative functional. Let us also add the following two 
remarks: 
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(i) A rough path (see Dehnition 12.11 below), as well as the signature of any smooth function, 
takes values in the strict subset G N (R d ) C T N (R d ) given by the "group-like elements" 

G N (R d ) = exp®(L N (R d )), 

where L N (R d ) is the linear span of all elements that can be written as a commutator of the 
type a (£> b — b ® a for two elements in T N (R d ). 

(ii) It is sometimes convenient to think of the indices w = . . . , i n ) in (|2.1j) as words based on 
the alphabet {1, . . . ,d}. We shall then write x. w for the iterated integral J A „ dx n ■ ■ ■ dx ln . 

More generally, if TV > 1 we can consider the set of such group- valued paths 

x t = (l,x t \...,xf)eG w (R d ). 

Note that the group structure provides a natural notion of increment, namely x s t := x.7 1 <3x t , and 
we can describe the set of "norms" on G N (R d ) which are homogeneous with respect to the natural 
scaling operation on the tensor algebra (see [12] for definitions and details). One such example is 
the Carnot-Carathcodory (CC) norm (see [12]), which we denote by || ■ \\ cc ; the precise one used is 
mostly irrelevant in finite dimensions because they are all equivalent. The subset of these so-called 
homogeneous norms which are symmetric and sub- additive (again, see |12| ) gives rise to genuine 
metrics on G N (K d ) , for example dec m the case of the CC norm. In turn these metrics give rise to 
a notion of homogenous p- variation metrics d p . var on the G N (K rf )-valued paths. Fixing attention to 
the CC norm, we will use the following homogenous p- variation and 7-Holder variation semi-norms: 

ll X llp-var:M = . ma * .( SU P W^^ + Mcc) ' 

II II \\ x u,v WcC 

Metrics on path spaces which are not homogenous will also feature. The most important will be 
the following 

N \„k I 

M*, TtM ■■= E su p 2 r v hk ■ ( 2 - 3 ) 

which will be written simply as 7Vx, 7 when [s,t] is clear from the context. 

Definition 2.1. The space of weakly geometric p-rough paths (denoted WGil p (M. d )) is the set of 
paths x : A 2 -> G LpJ (R rf ) such that fO|) is finite. 

We will also work with the space of geometric p-rough paths, which we denote by GQ p (R d ), 
defined as the d p _ V ar-closurc of 

{S LpJ (x):xeC 1 - v ^ ([0,T},R d )}. 

Analogously, if 7 > and N = [1/7] we define C°' 7 ([0, T];G N (R d )) to be the linear subspace of 
GQ N (R d ) consisting of paths x : A 2 ->• G N (R d ) such that 

lim ||x- S N (x n )\\ r ,[o,T] = 

for some sequence (x n )™ =1 C C°°([0, T];R d ). 

In the following we will consider RDEs driven by paths x in WGil p (R d ), along a collection of 
vector fields V = (Vi, . . . , Vd) on R e , as well as a deterministic drift along Vq. From the point of 
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view of existence and uniqueness results, the appropriate way to measure the regularity of the V^'s 
turns out to be the notion of Lipschitz-7 (short: Lip-7) in the sense of Stein (see [12] and [26]). 
This notion provides a norm on the space of such vector fields (the Lip-7 norm), which we denote 
For the collection V of vector fields we will often make use of the shorthand 



I Lip-7 ' 



\v\ 



Lip-7 



max \Vi 

i=l,...,d 



Lip-7 ' 



and refer to the quantity |V| Li as the Lip-7 norm of V. 

A theory of such Gaussian rough paths has been developed by a succession of authors (see [8] 
Q3J [7] [TT]) and we will mostly work within their framework. To be more precise, we will assume 
that X t = (X{ , . . . , Xf} is a continuous, centred (i.e. mean zero) Gaussian process with i.i.d. 
components on a complete probability space (ft,J-,P). Let W = C([0, T], M. d ) and suppose that 
(W,H, p) is the abstract Wiener space associated with X. The function R : [0, T] x [0, T] — > R will 
denote the covariance function of any component of X , i.e.: 

R(s,t)=E[xlxl]. 

Following [14] , we recall some basic assumptions on the covariance function of a Gaussian process 
which arc sufficient to guarantee the existence of a natural lift of a Gaussian rough process to a 
rough path. We recall the notion of rectangular increments of R from |15| , these arc defined by 



R 



s, t 
u, v 



:=E[(XI-Xl) {Xl-Xl)] 



The existence of a lift for X is ensured by insisting on a sufficient rate of decay for the correlation of 
the increments. This is captured, in a very general way, by the following two-dimensional p- variation 
constraint on the covariance function. 



Definition 2.2. Given 1 < p < 2, we say that R has finite (two-dimensional) p-variation if 



V P {R- [0,T] x [0,T] 



E 



sup 

D=(t i )eD([o,r J/ 

£)' = (t:)eX>([0,T]) 



R 



t-ij H+l 

¥ ¥ 



< 00. 



(2.4) 



If a process has a covariance function with finite p-variation for p £ [1,2) in the sense of Dcfi- 
nition |2"T21 [T4l Thm 35] asserts that (X t ) te ^ T j lifts to a geometric p-rough path provided p > 2p. 
Moreover, there is a unique natural lift which is the limit, in the dj,_ var -induced topology, of the 
canonical lift of piecewise linear approximations to X. 

A related take on this notion is obtained by enlarging the set of partitions of [0,T] 2 over which 
the supremum is taken in (|2 .4[) . Recall from |15] that a rectangular partition of the square [0, T] is 
a collection {Ai : i e /} of rectangles of the form Aj = [si,ti] x [ui,Vi], whose union equals [0,T] 
and which have pairwise disjoint interiors. The collection of rectangular partitions is denoted 
^rec([0, T 1 ] 2 ), and R is said to have controlled p-variation if 



I R\ p 

I lp-var;[0,T] 2 



sup Yl 

{A !:4 G/}eP r cc([0,T] 2 ) itj 
Ai = [si,ti]x[in,vi] 



R 



Ui,VJ 



< 00. 



(2.5) 



We obviously have V p (R; [0, T] 2 ) < \R\ p _ val .[ T ]2, and it is shown in [TS] that for every e > there 



exists c Pi£ such that \R\ 



p-var;[0,T] s 



< C, 



P ,eV fl+ t(R;[0,T] 2 



The main advantage of the quantity (|2.5[) 
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compared to (|2.4|) is that the map 

[a,t] x [u,v] ^ \R\ P p . vai;Mx[uM 
is a 2D control in the sense of |15j . 

Definition 2.3. Given 1 < p < 2, we say that R has finite (two-dimensional) Holder-controlled 
p-variation if V p (R; [0, T] x [0, T]) < oo, and if there exists C > such that for all < s < t < T 
we have 

V p (R;[s,t] x M) < C(t-s) 1/p . (2.6) 

Remark 2.4. This is (essentially) without loss of generality compared to Definition \2.2i To see 
this, we note that if R also has controlled p-variation in the sense of (2.5}) . then we can introduce 
the deterministic time-change r : [0,T] — > [0,T] given by r = ct" 1 , where a : [0, T] — > [0,T] is the 
strictly increasing function defined by 

T\RV 



' p-var",[0,t] 
*p-var;[0,T] 2 

It is then easy to see that R, the covariance function of X = X or, is Holder- controlled in the sense 



of Definition ULB 

Two important consequences of assuming that R has finite Holder-controlled p-variation are: (i) 
X has 1/p— Holder sample paths for every p > 2p, and (ii) by using [T^l Thm 15.33] we can deduce 
that 



E 



exp 



(?7l|X||i /p;[0iT] )] < oo for some r, > 0, (2.8) 



i.e ||X||^ p .j 0T j has a Gaussian tail. 

The mere existence of this lift is unfortunately not sufficient to apply the usual concepts of 
Malliavin calculus. In addition, it will be important to require a complementary (Young) regularity 
of the sample paths of X and the elements of its Cameron-Martin space. The following assumption 
captures both of these requirements. 

Condition 1. Let (X t ) tl£ ^ T j = {X}, ■ ■ ■ ,X^ t ^ Q _ be a Gaussian process with i.i.d. components. 
Suppose that the covariance function has finite Holder- controlled p-variation for some p € [1,2). 
We will assume that X has a natural lift to a geometric p-rough path and that %, the Cameron- 
Martin space associated with X , has Young- complementary regularity to X in the following sense: 
for some q > 1 satisfying l/p+l/q> 1, we have the continuous embedding 

H <-> C q ' var ([0, T] , R d ) . 

The following theorem appears in [14] as Proposition 17 (cf. also the recent note [15]): it shows 
how the assumption V p (R; [0, T] ) < oo allows us to embed H, in the space of continuous paths 
with finite p variation. The result is stated in [14| for one-dimensional Gaussian processes, but the 
generalisation to arbitrary finite dimensions is straightforward. 

Theorem 2.5 ( [ 1 4 j ) . Let (X t ) te ^ T ^ = (X^, . . . , Xf^ t ^ Q T , be a mean-zero Gaussian process with 
independent and identically distributed components. Let R denote the covariance function of (any) 
one of the components. Then if R is of finite p-variation for some p G [1,2) we can embed % in 
the space C p ' var ([0, T] , R d ) , in fact 

\hL > Np - t ""' ;[0 ' T1 (2.9) 

'* " jV p {R; [0,T]x [0,T]) 
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Remark 2.6 ([IS])- Writing T-L H for the Cameron- Martin space of fBM for H in (1/4,1/2), the 
variation embedding in |13) gives the stronger result that 

rl H ^ C q - var ([0,T],R d ) for any q > (H + 1/2)" 1 . 

Theorem l2.5l and Remark l2.6l providc sufficient conditions for a process to satisfy the fundamental 
Condition [TJ which we summarise in the following remark. 

Remark 2.7. As already observed, the requirement that R has finite 2D p-variation, for some 
p G [1,2), implies both that X lifts to a geometric p-rough path for all p > 2p and also that 
H C p ~ var ([0, T] , R d ) (Theorem \2.5\) . Complementary regularity ofH in the above condition thus 
can be obtained by p G [1,3/2), which covers for example BM, the OU process and the Brownian 
bridge (in each case with p = 1). When X is fBm, we know that X admit a lift to Gfl p (R d ) if 
p > 1/H, and Remark \2.6\ therefore ensures the complementary regularity of X and H if H > 1/4. 



3. Statement of the main theorem 

We will begin the section by laying out and providing motivation for the assumptions we impose 
on the driving Gaussian signal X. We will then end the section with a statement of the central 
theorem of this paper, which is a version of Hormander's Theorem for Gaussian RDEs. First, we 
give some notation which will feature repeatedly. 

Notation 1. We define 

F a ,b ■= o- (X v y : a < v < v' < b) 
to be the a-algebra generated by the increments of X between times a and b. 

The following condition aims to capture the non-degeneracy of X, it will feature prominently in 
the sequel. 

Condition 2 (non-determinism-type condition). Let (X t ) t ^ r i be a continuous Gaussian process. 
Suppose that the covariance function R of X has finite Holder- controlled p-variation for some p in 
[1,2). We assume that there exists a > such that 

nJ^r TT~^ Var C^l^o." v ^ > °" 

0<s<t<T — S ) 

Whenever this condition is satisfied we will call a the index of non- determinism if it is the 
smallest value of a for which \3.1\) is true. 

Remark 3.1. It is worthwhile making a number of comments. Firstly, notice that the conditional 
variance 

Ww{X s>t \F Q , s y F t , T ) 

is actually deterministic by Gaussian considerations. Then for any [s,t] C [0,S] C [0,T], the law 
of total variance can be used to show that 

Var (X St t\Fo, s V F t , s ) > Var (X s , t \F 0)S V T t , T ) ■ 

It follows that if LS. 1)) holds on [0,T], then it will also hold on any interval [0,5] C [0, T] provided 
S > 0. 
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Note that Condition [2] implies the existence of c > such that 

Var (X Stt \Fo, s V F t , T ) >c(t-s) a . 

This is reminiscent of (but not equivalent to) other notions of non-determinism which have been 
studied in the literature. For example, it should be compared to the similar notion introduced 
in [3j, where it was exploited to show the existence of a smooth local time function (see also the 
subsequent work of Cuzick et al. [9] and [10]). In the present context, Condition [2] is also related 
to the following condition: for any / of finite p- variation over [0, T] 

[ f s dh s =0 VheH => / = a.e. on [0,T]. (3.2) 
Jo 

This has been used in [5] to prove the existence of the density for Gaussian RDEs. In some sense, 
our Conditionals the quantitative version of (|3.2j) . In this paper when we speak of a non-degenerate 
Gaussian process {X t ) t ^ T i we will mean the following: 

Definition 3.2. Let (Xt) t€ ^ T i be a continuous, real-valued Gaussian process. If D = {ti : i = 0, 1, . . 
is any partition of [0,T] let (Qfj)i<ij< n denote the nx n matrix given by the covariance matrix of 
the increments of X along D, i.e. 



Q %= R WZl)- (3 - 3) 

We say that X is non- degenerate if Q D is positive definite for every partition D of [0,T]. 

Remark 3.3. An obvious example of a 'degenerate' Gaussian process is a bridge processes which 
return to zero in [0,T]. This is plainly ruled out by an assumption of non- degeneracy. 

It is shown in [7] that non-degeneracy is implied by (|3 . 2|) . Thus non-degeneracy is a weaker 
condition than (|3.2j) . It also has the advantage of being formulated more tangibly in terms of 
the covariance matrix. The next lemma shows that Condition [2] also implies that the process is 
non-degenerate. 

Lemma 3.4. Let (X t ) t£ ^ T ^ be a continuous Gaussian process which satisfies Condition^ then X 
is non- degenerate. 

Proof. Fix a partition D of [0, T], and denote the covariance matrix along this partition by Q with 
entries as in (|3.3[) . If Q is not positive definite, then for some non-zero vector A = (Ai, . . . , A„) G K™ 
we have 

\ 2 "1 



= X Q\ = E 



XiXt il .t i 



(3.4) 



Suppose, without loss of generality, that Xj ^ 0. Then from fl3.4[) we can deduce that 

- K 

X t] _ l ,t j = ^ -^-x ti _ ltti 

with probability one. This immediately implies that 

V8x{X tj _ ljtj \F , tj _ 1 V^, T ) =0, 
which contradicts (13.11). □ 
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A crucial step in the proof of the main theorem is to establish lower bounds on the eigenvalues of 
the Malliavin covariance matrix in order to obtain moment estimates for its inverse. In the setting 
we have adopted, it transpires that these eigenvalues can be bounded from below by some power 
of the 2D Young integral: 

f f s f t dR(s,t), (3.5) 

J[0,T] 2 

for some suitable (random) function / G (jp-vnr Qo, T],M d ). By considering the Riemann sum 
approximations to (|3.5j) . the problem of finding a lower bound can be re-expressed in terms of solving 
a sequence of finite-dimensional constrained quadratic programming problems. By considering the 
dual of these problems, we can simplify the constraints which appear considerably; they become 
non-negativity constraints, which are much easier to handle. Thus the dual problem has an explicit 
solution subject to a dual feasibility condition. The following condition is what emerges as the limit 
of the dual feasibility conditions for the discrete approximations. 

Condition 3. Let (-Xt)teroT] ^ e a continuous, real-valued Gaussian process. We will assume that 
X has non-negative conditional covariance in that for every [u,v] C [s,t] C [0,5] C [0,T] we have 

Cov(X 3 , t ,X UtV \F 0!S V T t ,s) > 0. (3.6) 

In Section [S] we will prove a novel interpolation inequality. The significance of Condition [3] will 
become clearer when we work through the details of that section. For the moment we content 
ourselves with an outline. Firstly, for a finite partition D of the interval [0,T] one can consider 
the discretisation of the process X t conditioned on the increments in D<~) ([0, s] U [t, T}). Let Q D be 
the corresponding covariance matrix of the increments (see p.3|) ). Then the conditional covariance 
Cov (X® t , X^.J\Fq s V ^ r ( D T ) of the discretised process can be characterised in terms of a Schur 
complement £ of the matrix Q D . Utilising this relation, the condition 

Cov(X° t ,X° v \T Q D s VT° T ) >0 

is precisely what ensures that the row sums for S are non-negative. Conversely, if for any finite 
partition D all Schur complements of the matrix Q D have non-negative row sums Condition [3] is 
satisfied. This relation motivates an alternative sufficient condition that implies Condition [3j which 
has the advantage that it may be more readily verified for a given Gaussian process. In order to 
state the condition, recall that an n x n real matrix Q is diagonally dominant if 

Qu>^2\Qij\ for every i € {1,2,..., n}. (3.7) 

Condition 4. Let (X t ) te ^ Q T j be a continuous real-valued Gaussian process. For every [0, 5] C [0, T] 
we assume that X has diagonally dominant increments on [0,5]. By this we mean that for every 
partition D = {ti : i = 0, 1, . . . , n} of [0, 5], the n x n matrix {Qfj) 1Ki - <n with entries 

Qtj = E [Xu-uU x tj-ut,] =R { tjllft] ) 

is diagonally dominant. 

Diagonal dominance is obviously in general a stronger assumption than requiring that a covari- 
ance matrix has positive row sums. Consequently, Condition [4] is particularly useful for negatively 
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correlated processes, when diagonal dominance of the increments and positivity of row sums are 
the same. The condition can then be expressed succinctly as 

E [X , s X s , t ] > V [a, t] C [0, S] C [0, T] . 

In fact, it turns out that Condition |4] implies Condition |3l This is not obvious a priori, and 
ultimately depends on two nice structural features. The first is the observation from linear algebra 
that the property of diagonal dominance is preserved under taking Schur complements (see |32j 
for a proof of this). The second results from the interpretation of the Schur complement (in the 
setting of Gaussian vectors) as the covariance matrix of a certain conditional distribution. We will 
postpone the proof of this until Section [5] when these properties will be used extensively. 

The final condition we will impose is classical, namely Hormandcr's condition on the vector fields 
defining the RDE. 

Condition 5 (Hormander). We assume that 

span{V u ...,V d , [Vi,Vj] , [V h [V h V k ]} : i, j, k, . . , = 0, 1, . . . , d} \ yo = T ya M. e = K e . (3.8) 

We are ready to formulate our main theorem. 

Theorem 3.5. Let (Xt) te ^ T ^ = [Xl, . . . , Xf) t ^ Q T j be a continuous Gaussian process, with i.i.d. 
components associated to the abstract Wiener space (W, %, p). Assume that some (and hence every) 
component of X satisfies: 

(1) Condition^ for some p <E [1,2); 

(2) Condition^ with index of non- determinancy a < 2/p; 

(3) Condition^ i.e. it has non-negative conditional covariance. 

Fix p > 2p, and let X eGfi p (K. d ) denote the canonical lift of X to a Gaussian rough path. 
Suppose V = (Vi, . . . , Vd) is a collection of C°° -bounded vector fields on K e , and let (Y t ) te ^ T i be 
the solution to the RDE 

dY t = V(Y t )dX t + Vo(Y t )dt, Y(0) = y . 

Assume that the collection (Vq, Vi, . . . , Vd) satisfy Hormander's condition, Condition^ at the start- 
ing point yo. Then random variable Y t has a smooth density with respect to Lebesgue measure on 
R e for every t G (0, T] . 

4. Examples 

In this section we demonstrate how the conditions on X we introduced in the last section can be 
checked for a number of well known processes. We choose to focus on three particular examples: 
fractional Brownian motion (fBm) with Hurst parameter H > 1/4, the Ornstein Uhlenbeck (OU) 
process and the Brownian bridge. Together these encompass a broad range of Gaussian processes 
that one encounters in practice. Of course there are many more examples, but these should be 
checked on a case-by-case basis by analogy with our presentation for these three core examples. We 
first remark that Conditions [1] is straight forward to check in these cases (see e.g. [12] and [7]). We 
will therefore commence with a verification of the non-determinism condition i.e. Condition [5] 

4.1. Non-determinism-type condition. Recall that the Cameron-Martin space W is defined to 
be the the completion of the linear space of functions of the form 

n 

ai R (ti, •) , at e R and t t e [0, T] , 

i=l 
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with respect to the inner product 



y^aiR(ti, •) , bjR (sj, ■)) — X! a ibjR (*< 

j = l I nj 1=1 j = l 



i i=l 



Some authors prefer instead to work with the set of step functions £ 

£ = \Y j a t l%u] ■■a i €R,t i e [0,T]l, 



equipped with the inner product 



(l[0,t]) l[o,«])a/ - R (M) ■ 



If H denote the completion of £ w.r.t (•,•)•«> then it is obvious that the linear map <p : £ — > % 
defined by 

<t>{l m )=R{t,-) (4.1) 

extends to an isometry between T~L and T-L. We also recall that H is isometric to the Hilbert space 
H 1 (Z) C L 2 P) which is defined to be the 



\L 2 (Q) 



■closure of the set: 



{X]" =1 aiZu ■ ^ € M, U e [0, T] , n e N} . 

In particular, we have that 1 1 [0,4] | ^ = | Z t \ L i ^ . We will now prove that Condition[5]holds whenever 

it is the case that H embeds continuously in L q ([0,T]) for some q > 1. Hence, Condition [2] will 
simplify in many cases to showing that 



\h\Li[o,T] < C\h\^, 



for some C > and all h £ H. 



Lemma 4.1. Suppose (Zt) tg j 0T j is a continuous real-valued Gaussian processes. Assume that for 
some q > 1 we have H. ^L q ([0,T]). Then Z satisfies Condition^ with index of non- determinancy 
less than or equal to 2/q; i.e. 

1 



inf 

0< s <t<T {t- S f ,q 



Var(Z M |Jo, s V T t , T ) > 0. 



Proof. Fix [s,t] C [0,T] and for brevity let Q denote the cr-algebra J-"o iS V Tt,T- Then, using the 
fact that Var(Z Sji |C/) is deterministic and positive, we have 



Var(Z,, t |0) = || Var(Z s , t \Q)\\ 



= E 



E 



(z s , t -E{z s , t \g]) 2 \g 



1/2 



E 



(z s , t -E[z S!t \g}) 2 = \\z a , t -E[z a , t \g]\\ 



L 2 (0) 



inf 



\Z s .t Y\\ L2 ^ 



We can therefore find sequence of random variables (Y n )°° =1 C L 2 (fi, , P) such that 



l s ,t Y n \\ L 2( ) — E 



(z„ tt -Y n y iVaT(z Stt \g). 



(4.2) 
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Moreover because E [Z Sjt \G] belongs to the closed subspace H 1 (Z), we can assume that Y n has the 
form 



for some sequence of real numbers 
and a collection of subintervals 



K 
{[«?.< 



i+U 



1. 



i &n } 



which satisfy [i™, C [0, s] U [s, T] for every n G N. 

We now exhibit a lower bound for \\Z s . t — YnW^fm which is independent of n (and hence from 

(|4.2[) will apply also to Var (Z St t\Q)). Let us note that the isometry between the H 1 (Z) and T-L 
gives that 

\\Z s .t — Y n 

where 



I 2 



l 2 ~ 



(4.3) 



The embedding "H °->L 9 ([0,T]) then shows that 

> c\Ml«[0,T] > C(t-S) 

The result follows immediately from this together with (14.21) and (|4.3[) . 



2/9 



□ 



Checking that embeds continuously in a suitable L q ([0, T]) space is something which is readily 
done for our three examples. This is what the next lemma shows. 

Lemma 4.2. If(Z t ) t£[QT] is fBm with Hurst index H £ (0, 1/2) and q G [1,2) thenU ^ L« ([0,T]). 
7/(Z t ) tg j 0T j is i/ie (centred) Ornstein-Uhlenbeck process or the Brownian bridge (returning to zero 
after time T) then H <-> L 2 ([0,T]). 

Proof. The proof for each of the three examples has the same structure. We first identify an 
isometry K* which maps T~L surjectively onto L 2 [0, T]. (The operator K* is of course different for 
the three examples). We then prove that the inverse (K*)~ is a bounded linear operator when 
viewed as a map from L 2 [0,T] into L q [0,T]. For fBm this is shown via the Hardy-Littlewood 
lemma (see [3U| )■ For the OU process and the Brownian bridge it follows from a direct calculation 
on the operator K* . Equipped with this fact, we can deduce that 



Li[0,T] 



(K*Y K*h 



< 



L"[0,T] 



(K* 



K*h 



L 2 [0,T] 



(K*y 



\h\u, 



which concludes the proof. 



□ 



As an immediate corollary of the last two lemmas we can conclude that the (centred) Ornstein- 
Uhlenbeck process and the Brownian bridge (returning to zero after time T) both satisfy Condition[2] 
with index of non-determinism no greater than unity. In the case of fBm (^^)te[o,Tl the scaling 
properties of Z H enable us to say more about the non-determinism index than can be obtained by 
an immediate application of Lemmas 14.11 and 14.21 To sec this, note that for fixed [s,t] C [0, T] we 
can introduce a new process 



-H 



7 H 
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Z defines another fBm, this time on the interval [0, T (t — s) 1 ] =: [0, T}. Let u = s (t — s) 1 , v = 
t (t — s)~ and denote by P a ,b the er-algebra generated by the increments of Z in [a, b}. Scaling then 
allows us to deduce that 

Var(Z a , t |7b, a V T t , T ) = (* - s) 2H Vsx{Z u , v \ T . u V F v t ). (4.4) 

By construction u — v = 1. And since Z is fBm it follows from Lemmas 14.11 and 14 . 2 1 that 

inf Vax(Z u , v \Fo !U V^ f )> 0. (4.5) 

[u,v]C[0,T], 
\u — v\ — l 

It follows from (|4.4j) and (J4T5J) that Z w satisfies Condition [2] with index of non-determinancy no 
greater than 2H. 

4.2. Non-negativity of the conditional covariance. We finally verify that our example pro- 
cesses also satisfy Condition [3] We first consider the special case of process with negatively corre- 
lated increments. 

4.2.1. Negatively correlated increments. From our earlier discussion, it suffices to check that Condi- 
tion [4] holds. In other words, that Q D is diagonally dominant for every partition D. This amounts 
to showing that 

E [Z ti _ uU Zo, T ] > 

for every < < U <T. It is useful to have two general conditions on R which will guarantee 
that (i) the increments of Z are negatively correlated, and (ii) diagonal dominance is satisfied. Here 
is a simple characterisation of these properties: 
Negatively correlated increments: If i < j, write 

Qij = E [Z ti _ ltti Z tj _ utj ] = f I' d 2 ab R(a, b) dadb, 

so that a sufficient condition for < is d^ b R(a, b) < for a < b. This is trivially verified for 
fBm with H < 1/2. Note that d^ b R(a, b) might have some nasty singularities on the diagonal, but 
the diagonal is avoided here. 

Diagonal dominance: If we assume negatively correlated increments, then diagonal dominance is 
equivalent to y\ Qy > 0. Moreover, if we assume Zq is deterministic and Z is centred we get 

V Qij = E [Z t ._ uti Z T ]= [' d a R(a,T) da, 

so that a sufficient condition for • Qij > is d a R(a, b) > for a < b. This is again trivially 
verified for fBm with H < 1/2. 

Example 4.3. In the case where (Z t ) te ^ 0T ^ is the Brownian bridge, which returns to zero at time 
T" > T we have 

R (a, b)=a (T' - b) , for a < b. 

It is then immediate that d a 2 b R{a,b) = —1 < and d a R{a,b) = T' — b > 0. Similarly, for the 
centred Ornstein- Uhlenbeck process we have 

R(a, b) = 2e~ b sinh(a) , fora<b. 

From which it follows that d a 2 b R{a, b) = ~2e~ b cosh (a) < and d a R(a, b) — 2e~ b cosh (a) > 0. 
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4.2.2. Without negatively correlated increments. In the three examples we were able to check Con- 
dition [3] by using the negative correlation of the increments and showing explicitly the diagonal 
dominance. In the case where the increments have positive or mixed correlation we may have to 
check the weaker condition, Condition [3l directly. An observation that might be useful in this 
regard is the following geometrical interpretation. Recall that we want to want to check that 

Cov(Z M ,Z„.,|Jo, s V JF i)T ) > 0. 

For simplicity, let X = Z s> t, Y = Z U)V and Q = J-"o, s V Tt,T- The map Pg : Z M> E [Z\Q] then defines 
a projection from the Hilbert space L 2 (fi, J-,P) onto the closed subspacc L 2 (ft, Q, P), which gives 
the orthogonal decomposition 

l 2 (ft, t, p) = l 2 (ft, g, p) e l 2 (ft, g, P) x . 

A simple calculation then yields 

Cov (A, Y\g) = E [Cov (X, Y\g)] =E[(I-P g )X(I- Pg)Y] = {P^X, P^Y) La(a) , 

where Pg is the projection onto L 2 (ft, Q, P) ■ In other words, Cov (A, Y\Q) > if and only if 
cos# > 0, where 9 is the angle between the projections PgX and PgY of, respectively, X and Y 
onto the orthogonal complement of L 2 (tt,g,P). 

5. A Norris-type Lemma 

In this section we generalise a deterministic version of the Norris Lemma, obtained in |21j for p 
rough paths with 1 < p < 3, to general p > 1. It is interesting to note that the assumption on the 
driving noise we make is consistent with |21j . In particular, it still only depends on the roughness 
of the basic path and not the rough path lift. 

5.1. Norris 's lemma. To simplify the notation, we will assume that T = 1 in this subsection; all 
the work will therefore be done on the interval [0, 1]. Our Norris type lemma relies on the notion 
of controlled process, which we proceed to define now. Recall first the definition contained in |16j 
for second order rough paths: whenever x <G C° n ([0, 1]; G N (M. d )) with 7 > 1/3, the space Q X (K) 
of controlled processes is the set of functions y £ C 7 ([0, 1];K) such that the increment y s t can be 
decomposed as 

Vet = y\x\,t+ r s,t, 

where the remainder term r satisfies |r Sj t| < c y \t — s| 27 and where we have used the summation over 
repeated indices convention. Notice that y has to be considered in fact as a vector (y, y 1 , . . . , y d ). 

In order to generalize this notion to lower values of 7, we shall index our controlled processes by 
words based on the alphabet {1, . . . , d}. To this end, we need the following additional notations: 

Notation 2. Let w = (ix, ■ ■ ■ ,i n ) cind w = (jx, ■ ■ ■ ,j m ) be two words based on the alphabet 
{1, . . . ,d}. Then \w\ = n denotes the length ofw, and ww stands for the concatenation (ii, . . . ,i n , 
ji, . . . ,j m ) of w and id. For L > 1, Wl designates the set of words of length at most L. 

Let us now turn to the definition of controlled process based on a rough path: 

Definition 5.1. Let x E C°' 7 ([0, l];G N (R d )), with 7 > 0, N = [1/7]. A controlled path based 
on x is a family (y w ) w ew N - 1 indexed by words of length at most N — 1, such that for any word 
w € Wjv-2 we have: 

Vlt= E vTKt+rlu ™here |r£| < c v \t - S |( w -Mb (5>1 ) 
tBeWjv_i_| t „| 
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In order to take the drift term of {TTT} into account, we also assume that for w = we get a 
decomposition for the increment y s t of the form 



y., t = E y?xtt+y s (t-s)+r y st , where |r* t | < Cy\t - s\ N \ 



(5.2) 



The set of controlled processes is denoted by QJ, and the norm on is given by 

Ni<K = iiy ii7+ E m- 

We next recall the definition of 9- Holder- roughness introduced in [21] . 

Definition 5.2. Let S 6 (0,1). A path x : [0, T] — > W 1 is called 9-H6lder rough if there exists a 
constant c > such that for every s in [0,T] ; every e in (0,T/2], and every cf> in R d with \<j>\ = 1, 
there exists t in [0,T] such that e/2 < \t — s\ < e and 

\(4>,x s , t )\>ct 9 . 

The largest such constant is called the modulus of 9 -Holder roughness, and is denoted Lg (x). 

A first rather straightforward consequence of this definition is that if a rough path x happens to 
be Holder rough, then the derivative processes y w in the decomposition ()5.1[) of a controlled path 
y is uniquely determined by y. This can be made quantitative in the following way: 

Proposition 5.3. Let x G C°' 7 ([0, 1]; G N (R d )), with 7 > and N = [1/7]. We also assume that 
x is a 6 -Holder rough path with 6 < 27. Let y be a H-valued controlled path defined as in Definition 
I5.il and set y n {y) = su V\ w \=n II 2/™ II 00 ■ Then there exists a constant M depending only on d such 
that the bound 

' N—n g „ \ 

1+ E (lly'Hloo||x% ffi | 7 )'-'-'+ ]T 



y n {y) < 



M [y n -i{y)] l -% 



Lg(x) 



W\\ (iV-n + l)-, 

ll(A-n+l)7 



(5.3) 



\ |ui|=2 \w\=n-l 
holds for every controlled rough path an d every 1 < n < N — 1. 



Proof. For sake of clarity we shall assume that y° = 0, leaving to the patient reader the straight- 
forward adaptation to a non vanishing drift coefficient. Now start from decomposition (|5.1|) . and 
recast it as 

d 

vlt = E y» ix< & + E y^<t + 

j=l 2<|tD|<JV-l-|tu| 

where we have set wj for the concatenation of the word w and the word (J) for notations! sake. 
This identity easily yields 



sup 

|t-«|<e 



E^ x 2 



3=1 



<2||y^ 



E 

2<|u)|<JV-l-|to| 



y^Hoollx^ll^iel*!^ 



+ \\ rW \\j(N-\w\) 



(N-\iv\)~f 



(5.4) 

Since x is 9- Holder rough by assumption, for every j < d, there exists v = v(j) with e/2 < \v — s\ < e 
such that 

d 

' >Le{x)s \(y:\...,y: d )\. (5.5) 



E^ x S 
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Combining both (j5.4[) and (|5.5p for all words w of length n — 1, we thus obtain that 

c 



3Uy) < 



3 ; n-i(y)e" 



+ sup ^ ||y wffi || 00 ||x*|| 7N£ l*^- e + ||r w || 7( ^ Hl „| )£ ^-l' i 'l)T'- e 

I™!-' 1 \2<|w|<JV-l-|iu| > 

One can optimise the right hand side of the previous inequality over e, by choosing e such that the 
term y n -i{y) £~ S is of the same order as the other ones. The patient reader might verify that our 
claim (|5.3[) is deduced from this elementary computation. □ 

Remark 5.4. Definition ] 5. 1\ and Provosition \5.S\ can be generalized straightforwardly to d- dimensional 
controlled processes. In particular, if y is a d dimensional path, decomposition (|5.2[) becomes 

VU= E vi' W <t+rl% where |r#| < c y \t - s\ N \ (5.6) 

for all i = 1, . . . ,d. 

We now show how the integration of controlled processes fits into the general rough paths theory. 
For this we will use the non-homogeneous norm A/" x , 7 = A/" X;7 ,[o,i] introduced in (|2.3[) . 

Proposition 5.5. Let y be a d-dimensional controlled process, given as in Definition \5.1\ and whose 
increments can be written as in (|5.6[) . Then (x, y) is a geometrical rough path in G N ($L 2d ). In 
particular, for (s,t) £ A 2 , the integral I s t = / y l s dx\ is well defined and admits the decomposition 

J.,t = £(tf'<,+ E Vt<i)+<t, (5-7) 
3=i tuew N -i 

w/iere |rf jt | < AA x ||y|| 7 |i - s|( Ar+1 ^. 

Proof. Approximate x and y by smooth functions x m ,y m , while preserving the controlled process 
structure (namely y m £ Q x m )- Then one can easily check that (x m ,y m ) admits a signature, and 
that JJ" t = / y™' 1 dx™' 1 can be decomposed as (|5.7[) . Limits can then be taken thanks to [17], 
which ends the proof. 

□ 

The following theorem is a version of Norris' Lemma, and constitutes the main result of this 
section. 

Theorem 5.6. Let x be a geometric rough path of order N > 1 based on the W 1 -valued function x. 
We also assume that x is a O-Holder rough path with 2-f > 6. Let y be a W 1 -valued controlled path 
of the form given in Definition \5.1{ b £ C 7 ([0, 1]), and set 

ii 



z t 



E / vl dx l + b s ds = I st + b s ds 
i=1 Jo Jo Jo 



Then, there exist constants r > and q > such that, setting 

K = l + Lo( x y 1 +Af xn + \\y\\ QZ + \\b\\ c , , (5.8) 

one has the bound 

\\y\\oo + \\b\\oc < Mil* WzW^ , 
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for a constant M depending only on T , d and y. 

Proof. We shall divide this proof in several steps. In the following computations, k will designate a 
certain power for 1Z and M will stand for a multiplicative constant, whose exact values are irrelevant 
and can change from line to line. 

Step 1: Bounds on y. Combining (|5.7j) . the bound on r 1 given at Proposition 15 . 51 and the definition 
of 1Z, we easily get the relation 

|N|oo<Affc\ 

We now resort to relation (|5.3I) applied to the controlled path z and for n = 1, which means that 
y n (z) x \\y\\oa and y n -i(z) x Halloo- With the definition of 7Z in mind, this yields the bound 

||y||oo < M\\z\C^n\ (5.9) 

which corresponds to our claim for y. 

Along the same lines and thanks to relation (|5.3|) for n > 1, we iteratively get the bounds 

y n {y) < M \\z\\t~^ r 1l\ (5.10) 
which will be useful in order to complete the bound we have announced for b. 

Step 2: Bounds on r 1 and I . In order to get an appropriate bound on r, it is convenient to consider 
x as a rough path with Holder regularity f3 < 7, still satisfying the inequality 2/3 > 9. Notice 
furthermore that Af x ^ < Af xn . Consider now w <G W n . According to (|5.10[) we have 

iiznioo^MHziii^V, 

while || y 10 1| 7 < M1Z by definition. Hence, invoking the inequality 

\\y w h<ny w \\h\y w C^ , 

which follows immediately from the definition of the Holder norm, we obtain the bound 

\\y w h<M\\z\\L ^ ( 

which is valid for all w £ W ra and all n < N — 1. Summing up, we end up with the relation 

\\yh <M\\z\\L ^ 1 ^K K . 

Now according to Proposition 15. 5[ we get rj, t < Af Xt p\\y\\/3\t — s\^ N+1 ^ and the above estimate 
yields 

||rV+i)£< M INI~ { -- ] n\ 

Plugging this estimate into the decomposition (|5.7[) of / st we end up with 

\\I\\oo<M\\z\\oo 2 -' > ^n*. (5.11) 

Step 3: Bound on b. According to the bound (|5.11|) we have just obtained, we obviously have 



b s ds 







<M\\z\\ x v -""R K . 

Once again we use an interpolation inequality to strengthen this bound. Indeed, we have 

llft/Hoo < MH/IU max(i \\f\W^\\d t f\\$ 
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and applying this inequality to f t = J* b s ds, it follows that 

fl o ) N - 1 (i i )( ~* ) 

\\b\\oo < M\\z\\L 2 ^ ( ^ > ^+ 1> n K . (5.12) 
Gathering the bounds (|5.9[) and (|5 . 12[) , our proof is now complete. □ 

5.2. Small-ball estimates for L# (X). We now take X to be a Gaussian process satisfying Con- 
dition [5] As the reader might have noticed, equation (|5.8I) above involves the random variable 
Lg (X) , for which we will need some tail estimates. The non-determinism condition naturally 
gives rise to such estimates as the following lemma makes clear. 

Lemma 5.7. Suppose (-Xt)te[0T] is a zero-mean, R d -valued, continuous Gaussian process with 
i.i.d. components, with each component having a continuous covariance function R. Suppose that 
one (and hence every) component of X satisfies Condition^ Let ao > be the index of non- 
determinism for X and suppose a > aQ. Then there exist positive and finite constants C\ and C2 
such that for any interval Is C [0, T] of length 5 and < x < 1 we have 

P ( inf sup I (0, X st )\ < x] < Ci exp f-C 2 (5x" 2/a ) . (5.13) 
VM =1 s,teh / ^ ' 

Proof. The proof is similar to Theorem 2.1 of Monrad and Rootzen [55|; we need to adapt it because 
our non-determinism condition is different. 

We start by introducing two simplifications. Firstly, for any <j) in R with \<j)\ = 1 we have 

((^^)) te[ 0,T] £ (^ 1 ) t6[0 ,T]' ( 5 - 14 ) 

which implies that 

P ( sup \{<f>,X„ lt )\ < x) = P ( sup 1*^1 < x) . (5.15) 
\s.tei s J \s,tei s J 

We will prove that the this probability is bounded above by 

cxp (^— c6x 2 ^ c 

for a positive real constant c, which will not depend on T, 8 or x. The inequality (|5. 13[) will then 
follow by a well-known compactness argument (see [21] and [29]). The second simplification is to 
assume that 6 = 1. We can justify this by working with the scaled process 

x t = s a / 2 x t/s , 

which is still Gaussian process only now parametrised on the interval [0, T] := [0, T6]. Furthermore, 
the scaled process also satisfies Condition [2] since 



Var (X a ,t\Fo,. V F tt f) = S a Var (X s/S j /s \F a , s/s V F t/SiT ) 

>cS a (t^-Y = c (t-s) a . 

Thus, if we can prove the result for intervals of length 1, we can deduce the bound on (|5.15| wc 
want from the identity 



pf sup \Xl t \<x) = p( sup IX^I 
\s,t€i s J \s,teh 



< 



<W 2 
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To conclude the proof, we begin by defining the natural number n := \x ~ 2 / a \ > 1 and the 
dissection D (I) = {tj : i = 0, 1, . . . , n + 1} of J = I\, given by 

U = inf I + ix 2/a , i = Q, l,...,n 
t„ + i = inf I + 1 = sup /. 

Then it is trivial to see that 



P (sup \xl t \ <x)<P ^maxJK-uu\ ^ x ) 



(5.16) 



To estimate (|5.16|) we successively condition on the components of 

More precisely, the distribution of Xl n _ x tn conditional on (X^ tl , . . . ,Xj n _ 2 j n _ 1 ) is Gaussian with 
a variance a 2 . Condition [5] ensures that a 2 is bounded below by cx 2 . When Z is a Gaussian random 
variable with fixed variance, P (\Z\ < x) will be maximised when the mean is zero. We therefore 
obtain the following upper bound 

r x/a l ( \ ^ 



' —X 1 

Using xja < y/c, we can finally deduce that 

/ \ / Q x -2/a 

P I sup \Xl t \ < x I < exp (— Cn) < cxp I 

where C := log [2$ (y^) — l] -1 e (0,oo). □ 

Corollary 5.8. Suppose (X t ) t ^ ^ is a zero-mean, WL d -valued, continuous Gaussian process with 
i.i.d. components satisfying the conditions of Lemma \5.7\ Then for every 9 > a/2, the path 
(X t ) t6 j 0T j is almost surely 9-Hdlder rough. Furthermore, for < x < 1 there exist positive finite 
constants C\ and Ci such that the modulus of 6 -Holder roughness, Lg (X), satisfies 



P {Lg {X) < x) < d exp -C 2 x 



,-2/a 



Ln particular, under these assumptions we have that Lg (X) 1 is in f] p> oL p (f2). 

Proof. The argument of [H] applies in exactly the same way to show that Lg (X) is bounded below 
by 

where 

D e (X):= inf inf inf sup KMfi^i 

H0ll=i n>i fe<2« s ,tei k , n (2-™T) 

and L k . n := [(k - 1) 2""T, k2~ n T]. We can deduce that for any x £ (0, 1) 

\(<f>,X. 

r i uu sup 

\ IUII = 1 „ tr - r. 

n=l k=l 



P(Dg(X)<x)<yyP inf sup 1 <x 

- 1 11011=1 s ,t G £,„ (2-™T) 9 
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whereupon we can apply Lemma 15.71 to yield 

oo 

P {Dg (X) < x) < ci 2 ™ ex P ^-c22- n{1 - 28/a} T' 29/a x- 2/a 



By exploiting the fact that 8 > a/2, we can then find positive constants C3 and C4 such that 
P {Dg (A) <x)<c 3 jr exp (-c A nx- 2 ' a \ = c CX P ("^ 2/ ") 

< C5 exp ^ — C4X ^' 

which concludes the proof. □ 

6. An interpolation inequality 

Under the standing assumptions on the Gaussian process, the Malliavin covariance matrix of 
the random variable U£_ (yo) = Yt can be represented as a 2D Young integral (see [7]) 

In practice, showing the smoothness of the density boils down to getting integrability estimates 
on the inverse of inf ||„||=i v t Ctv, the smallest eigenvalue of Ct- For this reason we will be interested 



m 



We will return to study the properties of Ct more extensively in Section . For the moment, we look 
to generalise this perspective somewhat. Suppose / : [0, T] — > R is some (deterministic) real- valued 
Holder-continuous function, where 7 is Young-complementary to p, 2D-variation regularity of R. 
Our aim in this section is elaborate on the non-degeneracy of the 2D Young integral 

f s f t dR(s,t). (6.2) 

[0,T] 

More precisely, what we want is to use Conditions [5] and [3] to give a quantitative version of the 
non-degeneracy statement: 



/ f B f t dR(8,t) = 0=> f = 0. 

J[0,T] 



(6.3) 



To give an idea of the type of estimate we might aim for, consider the case where R = R BM is the 
covariance function of Brownian motion. The 2D Young integral (|6.2|) then collapses to the square 
of the L 2 -norm of / : 



f s f t dR BM (s,t) 

[0,T] 



\L 2 [0,T] 

and the interpolation inequality (Lemma A3 of [20]) gives 



l/IL ;[ o,T] < ^{T- 1/2 \f\ W Af lll^ H/lli-W]) • (6-5) 
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Therefore, in the setting of Brownian motion at least, (|6.5p and (|6.4[) quantifies (|6.3[) . The problem 
is that the proof of (|6.5[) relies heavily properties of the L 2 -norm, in particular we use the fact that 



if f(u) > c> for all u e [s,t] then |/| L2[Sjt] > c (f - s) 1/2 . 

We cannot expect for this to naively generalise to inner products resulting from other covariance 
functions. We therefore have to re-examine the proof of the inequality (|6.5[) with this generalisation 
in mind. 

It is easier to first consider a discrete version of the problem. Suppose D is some (finite) partition 
of [0, T]. Then the Riemann sum approximation to (|6.2j) along D can be written as 

f(D) T Qf(D), 

where Q is the the matrix (|3.3|) and / (D) the vector with entries given by the values of / at the 
points in the partition. The next sequence of results is aimed at addressing the following question: 

Problem 6.1. Suppose \ f\ oa .t s t i > 1 for some interval [s, t] C [0, T] . Can we find a positive lower 

bound f (D) Qf (D) which holds uniformly over some sequence of partitions whose mesh tends to 
zero? 

The next lemma is the first step towards securing an answer. 

Lemma 6.2. Let (Qij) i 2 n y be a real n x n positive definite matrix and k be an integer in 
{1, ...,n}. Suppose Q has the block decomposition 

g = ( Qn Q12 \ with Q n g K", Q12 € R k - n - k , Q21 € K"" fe ' fc , Q22 € K"- fe ^- fe . 

Let S denote the Schur complement of Qn in Q, i.e. S is the (n — k) x (n — k) matrix given by 

S = Q22 — Q2\Q\\Q\2- 

Assume S has non-negative row sums: 

n—k 

S v >0 for alii e{l,...,n- k} , (6.6) 

i=i 

and b > is such that b = (b, . . . , b) is in R"~ fc . Then the infimum of the quadratic form x T Qx 
over the subset 

C = {(xi,...,x n ) : (x k+1 ,...,x n ) > h} 

is given by 

n—k 

inf x T Qx = b T Sb =b 2 ^ 

Proof. Without loss of generality we may assume that 6=1. We can then reformulate the statement 
as describing the infimal value for the following constrained quadratic programming problem: 

mmx T Qx subject to Ax > (0 R k, l K n-*.) 

where R k := (0, . . . , 0) el', lijn-fe := (1, . . . , 1) € K n ~ k and A the n x n matrix which is defined 
(in the standard basis) by 

Ax = A(xi, . . .,x n ) T = (0 K k,x k+1 , . . . ,x n ) T . 
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The Lagrangian function of this quadratic programming problem (see e.g. [3] page 215) is given by 

L(x, A) = x T Qx + X T (-Ax + (0 K h,l Mn -k)) . 

Solving for 

V x L(x, A) = 2Qx - A T \ = 
and using the strict convexity of the function we deduce that x* = 1/2Q~ 1 A T X is the minimiser of 
L. Hence, the (Lagrangian) dual function g (A) := inf^ L (x, A) is given by 

9 (A) = -]\ T AQ- 1 A T \ + X T (Oh*, l R »-») 

and the dual problem consists of 

maxg (A) subject to A > 0. 

As Q _1 is positive definite the function g is strictly concave and the local maximum A* = 2Qb > 
that is obtained by solving Va<? (A) = with 

V A5 (A) = -^AQ- 1 A T X + X T (0 R *,1 R „-*) (6.7) 
is also the unique global maximum. Writing Q _1 in block form 



Q [ (Q" x ) 21 (Q- 1 ) 



' 12 
^22 

and using the definition of A it is easy to see that the vector 

A* := ^^(Q- 1 )" 1 ^-*) (6.8) 

solves (|6.7[) . We need to check that this vector is feasible for the dual problem, since then strong 
duality holds (see e.g. [3] pages 226-227) and the optimal values for the dual and primal problems 
coincide. 

In order to check that ()6.8[) is feasible we need to show A* > 0. To do this, we first remark that 
a straight-forward calculation gives the inverse of the sub-block (Q _1 ) 22 a s the Schur complement 
of Qn in Q; that is 

(Q -1 )" 1 = S = Q22 - Q 2 lQuQl2. 
Condition (|6.6j) gives immediately that for every i £ {k + 1, . . . , n} 

n — k 

\*=2"£S {i . k)j >0 

3=1 

and hence A* > 0. It now follows from strong duality that we have 

inf x T Qx = min g(X) = l£ n - k Sl^ n -k, 

as required. □ 

Suppose now that Q arises as the covariance matrix of the increments of a Gaussian process 
along some partition. In light of the previous lemma, we need to know when the Schur complement 
of some sub-block of Q will satisfy condition (|6.6[) . In the context of Gaussian vectors, these 
Schur complements have a convenient interpretation; they arc the covariance matrices which result 
from partially conditioning on some of the components. This identification motivates the positive 
conditional covariance condition (Condition [3]). 
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In order to present the proof of the interpolation inequality as transparently as possible, we first 
gather together some relevant technical comments. To start with, suppose we have two sets of real 
numbers 

D = {U : i = 0, 1, . . . , n} C D = {I, : i = 0, 1, . . . , h) C [0, T] 

ordered in such a way that < to < t± < . . . < t n < T, and likewise for D. Suppose s and t be 
real numbers with s < t and let Z be a continuous Gaussian process. We need to consider how the 
variance of the increment Z s j changes when we condition on 

T D := a (Z ti _ u ti :i = l,...,n), 
compared to conditioning the larger er-algcbra 

T D := a (z~ U i h : i = l,...,n) . 

To simplify the notation a little we introduce 

Q =o (z h i;u : {ti-uU} HD\D + 0) , 

so that 

F 5 =F D V g. 

Because 

(2 s , t ,z (0)!l ,..,V l!(i )el i+1 (6.9) 

is Gaussian, the joint distribution of Z s , t and the vector (|6.9|) conditional on jF d (or indeed J- D ) 
is once again Gaussian, with a random mean but a deterministic covariancc matrix. A simple 
calculation together with the law of total variance gives that 

Var {Z s j\F D ) = E [Var (Z s , t \T D V Q)] + Var (E [Z S ^ D V Q\ ) 

> E [Var {Z Sit \T D V Q)} = Var (z a , t \F 6> ) , 

which is the comparison we sought. We condense these observations into the following lemma. 

Lemma 6.3. Let (Zt) t ^ 0T ^ be a Gaussian process, and suppose that D and D are two partitions 
of [0, T] with D C D. Then for any [s, t] C [0, T] we have 

Var(Z M |J- D ) >Var(z Sit \F 6 ^ . 

Our aim is to show how the optimisation problem of Lemma 16.21 can be used to exhibit lower 
bounds on 2D Young integrals with respect to R. In order to do this we need to take a detour via 
two technical lemmas. The first is the following continuity result for the conditional covariance, 
which we need approximate when passing to a limit from a discrete partition. The situation we will 
often have is two subintervals [s, t] C [0, S] of [0, T], and a sequence of sets (-Dn)^ < L 1 of the form 

D n = Dl U D 2 n . 

(Z?y^_ 1 and (D%)°°_-. here will be nested sequences of partitions of [0, s] and [t,S] respectively 
with mesh (D l n ) — > as n — > 00 for i = 1, 2. If 

T D :=a(Z u>v :{u,v}CD), 

then we can define a nitration (G n ) ! ^_ 1 by Q n := J- D " V J- D ™ and ask about the convergence of 

Cov (z Ptq z UtV \g n ) 
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as n — > oo for subintervals [p, q] and [u,v] are subintervals of [0,5]. The following lemma records 
the relevant continuity statement. 

Lemma 6.4. For any p,q,u,v such that [p,q] and [u,v] are subintervals of[0,S] C [0, T] we have 

Cov (Z Pt qZ UtV \G n ) -> Cov (Zp^Zu^lTo^s V T t ,s) , 

as n — > oo. 

Proof. The martingale convergence theorem gives 

Cov (Z PA Z u>v \Q n ) -t Cov (Z Pi9 Z UiV \ V£° =1 C? n ) , a.s. and in LP for all p > 1. 
The continuity of Z and the fact that mesh {D n ) — > easily implies that, modulo null sets, one has 

□ 

We now introduce another condition on Z, which we will later discard. This condition is virtually 
the same as Condition [3j the only difference being that we insist on the strict positivity of the 
conditional variance. 

Condition 6. Let (^t) tg r T] be a real-valued continuous Gaussian process. We will assume that 
for every [u,v] C [s,t] C [0, S] C [0,T] we have 

Cov (Z a , t , Z u ,„| 7b, a V 7t )S ) > 0. (6.10) 

The second technical lemma we need will apply whenever we work with a Gaussian process that 
satisfies Condition [6] It delivers a nested sequence of partitions, with mesh tending to zero, and 
such that the discretisation of Z along each partition will satisfy the dual feasibility condition (i.e. 
flSSj) in Lemma MM- 

Lemma 6.5. Let (Z t ) te ^ 0T ^ be a continuous Gaussian process that satisfies Condition^ Then for 
every 0<s<t<S<T there exists a nested sequence of partitions 

(A„)^ 1 = ({C:* = 0,l,...,n rn }C =1 

of [0, S] such that: 

(1) mesh{D m ) — > as m — > oo; 

(2) {s,t} C D m for all m; 

(3) If Z™ and Z™ are the jointly Gaussian vectors 

Z™ = (Zt T , t7ll :t?€D m n ([0, a) U [t, S))) , 

Z?= (z tT , tT+i :t?e D m n[M)), 

wzt/i respective covariance matrices Q"\ and Q™ 2 - Then the Gaussian vector (Z™,Z™) has 
a covariance matrix of the form 

nm _ f Qll Ql2 \ 

and the Schur complement of Q"\ in Q m has non-negative row sums. 
Proof. See the appendix. □ 
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The next result shows how we can bound from below the 2D Young integral of a Holder- 
continuous / against R. The lower bound thus obtained is expressed in terms of the minimum 
of /, and the conditional variance of the Gaussian process. 

Proposition 6.6. Suppose R : [0,T] 2 — > K is the covariance function of some continuous Gaussian 
process (Z t ) t ^ T y Suppose R has finite 2D p-variation for some p in [1,2), and that Z is non- 
degenerate and has a non-negative conditional covariance (i.e. satisfies Condition^). Let 7 G (0, 1) 
be such that l/p + 7 > 1 and assume f £ C 7 ([0,T],R). Then for every [s,t] C [0,T] we have the 
following lower bound on the 2D- Young integral of f against R : 




Remark 6.7. We emphasise again that !Fa,b is the a-algebra generated by the increments of the 
form Z u ^ v for u,v £ [a, b] . 

Proof. Fix [s,t] C [0,T], and take b := inf ueM \f (u)\. 

Step 1 : We first note that there is no loss of generality in assuming the stronger Condition [6] 
instead of Condition [3] To see this, let (B t ) t£ ^ Q T -, be a Brownian motion, which is independent of 
(Z t ) t£ ^ T i, and for every e > define the perturbed process 

Z\ := Z t + eB t . 

It is easy to check that Z e satisfies the conditions in the statement. Let T p be the c-algebra 

generated by the increments Z^ v between times p and q (note that actually equals F Pt q V 
f (Bim : u < I < m < q)), and note that we have 

Cov (Z e Sft , ZZ )V \J% >a V Fl T ) = Cov (Z,, u Z U , U \F , S V F t , T ) + e 2 (u - v) > 

for every 0<s<u<v<t<T. It follows that Z e satisfies Condition [5] Let R e denote the 
covariance function of Z e . If wc could prove the result with the additional hypothesis of Condition 
[6l then it would follow that 

/ fufvdtf (u,v)> b 2 Var (Z 5 e . t \J% V T^) 

J[0,T] 2 

= b 2 Var (Z.dTo,. V F t , T ) + b 2 e 2 (t-s). (6.11) 

Because 

/ fufvdR e (u, v)= fufvdR (u, v) + e 2 \f\ 2 L3 r T] , 

J[0,T] 2 J[0,T] 2 1 ' J 

the result for Z will then follow from (|6. 1 1|) by letting e tend to zero. 

Step 2: We now prove the result under the additional assumption of Condition [5J By considering 
—/if necessary we may assume that / is bounded from below by b on [s, t\. Since we now assume 
Condition [6] we can use Lemma 16.51 to obtain a nested sequence of partitions {D r )'^ =1 such that 
{s,t} C D r for all r, mcsh(Z?,.) — >• as r — > 00, and such that the dual feasibility condition 
(property 3 in the Lemma [63]) holds. Suppose D — {ti : i = 0, 1, . . . , n} is any partition of [0, T] 
in this sequence (i.e. D = D r for some r). Then for some / < m G {0, 1, . . . ,n — 1} we have £/ = s 
and t m = t . Denote by / (D) the column vector 

/(£) = (/ (to),..-, /(^i)) T eR'\ 
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and Q = {Qi,j) 1<i J<n the symmetric n x n matrix with entries 

Qij =R ( tjll'X ) = E [ z n-i,u z ti-i,tj] ■ 

From the non-degeneracy of Z it follows that Q is positive definite. The Ricmann sum approxima- 
tion to the 2D integral of / against R along the partition D can be written as 

n n / \ n n 

X» ft /.'( If )= y Z'Zft t -ift j - 1 Qi d =f(D) T Qf{D). (6.12) 

If necessary, we can ensure that that last m — I components of / (D) are bounded below by b. To 
see this, we simply permute its coordinates using any bijective map r : {1, . . . ,n} — > {1, . . . ,n} 
which has the property that 

t (I + j) = n — m + I + j, for j = 0, 1, . . . , m — I. 

Fix one such map r, and let f T (D) denote the vector resulting from applying t to the coordinates 
of / (D). Similarly, let Q T = {QJj) 1<i . <n be the n x n matrix 

QJj = Qt{x)t(J) , 

and note that Q T is the covariance matrix of the Gaussian vector 

z = \ z K(i)-i,Km> ■ ■ ■ ' ■^ t x(„)-i,t T(re) ) ■ 
A simple calculation shows that 

f(D) T Qf(D) = f T (D) T Q T f T (D). 

We can apply Lemma 16.21 because condition (|6.6[) is guaranteed to hold by the properties of the 
sequence (D r )^ =1 . We deduce that 

m — l 

f (D) T Qf (D) = f T (D) T Q T f T (D) > b 2 ( 6 - 13 ) 

i,j=l 

where S is the {m — I) x (to — I) matrix obtained by taking the Schur complement of the leading 
principal (n — m + I) X (n — m + I) minor of Q. As already mentioned, the distribution of a Gauss- 
ian vector conditional on some of its components remains Gaussian; the conditional covariance is 
described by a suitable Schur complement . In this case, this means we have that 

S = C3ov[(Z tl , tl+I ,...,Z tm _ 1 , tm ) \Z t ._ utj , 3e{l,...,l}U{m+l,...,»}] . (6.14) 

If we define 

T D := a (Z tj _ utj : j G {1, . . . , 1} U {m + 1, . . . , n}) , 
to be the c-algebra generated by the increments of Z in D \ [s,t], then using (|6.14p we arrive at 

m—l Tn—l—1 

s » = E [{ z ti+i-i,ti +i ) (Zn+j-uti+i) \f d ] 

m—l—1 

j2 s[(v 1 , t , +j )nM(v lA+i )n 

= E [{Z s . t f \T D ] - E [Z s , t \T D ] 2 = Var (Z s , t \F D ) . (6.15) 
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To finish the proof we note that T C T^.s V Ft,Ti and exploit the monotonicity of the conditional 
variance described by Lemma 16.31 to give 

Vax(Z Stt \T D ) > Var(Z M |J- , s V T t , T ) . (6.16) 

Then by combining (|6.16|) , f|6.15p and (|6. 13|) in (|6.12p wc obtain 

n n 

X Yl h-Jtj-iQij > >>- Var (Z.,t|Jb,. V F t , T ) . 

i=l j=l 

Because this inequality holds for any D € {D r )°?L 1 , we can apply it for I? = D r and let r — > oo to 
give: 

/ UfvdR (u, w) > 6 2 Var (Z a . t | Jb, a V ^.t) , 

J[0,T] 2 

whereupon the proof is complete. □ 

We now deliver on a promise we made in Section [3] by proving that the diagonal dominance of 
the increments implies the positivity of the conditional covariance. 

Corollary 6.8. Let (^t) t6 [ T] ^ e a r s-od-valued continuous Gaussian process. If Z satisfies Condi- 
tion^ then it also satisfies Condition^ 

Proof. Fix s < t in [0, T], let (D n )^L 1 be a sequence of partitions having the properties described 
in the statement of Lemma 16.41 and suppose [u,v] C [s,t]. From the conclusion of Lemma 16.41 we 
have that 

Cov (Z Stt Z UtV \g n ) -> Cov (Z Sit Z u , v \F 0iS V T t ,T) (6.17) 

as n — > oo. Let Z n be the Gaussian vector whose components consist of the increments of Z over 
all the consecutive points in the partition D n U {s, u, v, t\. Let Q denote the covariance matrix 
of Z n . The left hand side of (|6.17[) is the sum of all the entries in some row of a particular 
Schur complement of Q. Z is assumed to have diagonally dominant increments. Any such Schur 
complement of Q will therefore be diagonally dominant, since diagonal dominance is preserved 
under Schur-complemcntation (see [32|). As diagonally dominant matrices have non- negative row 
sums it follows that Cov (Z s jZ UiV \Q n ) is non-negative, and hence the limit in (|6.17|) is too. □ 

We are now in a position to generalise the L 2 -interpolation inequality (|6.5p stated earlier. 

Theorem 6.9 (interpolation). Let (Z t ) te ^ QT ^ be a continuous Gaussian process with covariance 

function R : [0, T} 2 — > M. Suppose R has finite two-dimensional p-variation for some p in [1,2). 
Assume that Z is non- degenerate in the sense of Definition HOI and has positive conditional co- 
variance (i.e. satisfies Condition^). Suppose f € C([0,T],K) with 7 + 1/p > 1. Then for every 
< S <T at least one of the following inequalities is always true: 

ll/IUio.s] <ZE[ZI}- 1/2 (j [0Si J'ft dR («^ > ( 6 - 18 ) 
or, for some interval [s, t] C [0, S] of length at least 

ll/lloo;[0,S] \ 
2H/ll 7 ;[0,sJ ' 
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we have 

lll/llL[os]Var(Z a . t |Jb, s V7- t , s ) < / f v f v ,dR(v,v') . (6.19) 

Proof. We take S = T, the generalisation to < 5 < T needing only minor changes. / is continuous 
and therefore achieves its maximum in [0,T]. Thus, by considering — / if necessary, we can find 
t G [0, T] such that 

/W = ll/lloc;[0 I T]- 

There are two possibilities which together are exhaustive. In the first case / never takes any value 
less than half its maximum, i.e. 

Hence we can can apply Proposition ^. 61 to deduce (j6.18[) . In the second case, there exists u G [0, T] 
such that / (u) = 2~ 1 H/H^o T y Then, assuming that u < t (the argument for u > t leads to the 
same outcome), we can define 

S = SU P | U < 1 1 / 0) ^ \ H/!loo;[0,T] 

By definition / is then bounded below by H/Hoo-m 7*1 /2 on [s,t]. The Holder continuity of / gives 
a lower bound on the length of this interval in an elementary way 

5ll/lloo ;[ o,r] = l/(*)-/(«)l<ll/IW] i*-*r. 

which yields 

, , ( ll/lloo;[0,T] V 7 

Another application of Proposition 16.61 then gives (|6.19|) . □ 

Corollary 6.10. Assume Condition^ so that the p-variation of R is Holder-controlled, and for 
some c > and some a G (0, 1) we have the lower bound on the conditional variance: 

Var(Z s , t |7b, s V Ji,r) >c(t-s) a . 
Theorem \6.9\ then allows us to bound \\fWoQ.\QT\ a ^ ove by the maximum of 

ft 

2E [Z 2 T Y V2 ^J^J s f t dR(s,t)J 



and 



2 

7c 



/ r \ 7/(27+") 

\L ? isftdRM ) m ^ 



Proof. This is immediate from Theorem 16.91 □ 

In particular, if Z is a Brownian motion we have Var (Z s ^\J-o. s V J-t.r) = (t — s), hence Corollary 
16.101 shows that 

ll/IL,^ < 2 max (T-W \f\ W] , l/ftjgf 1 ' M^gR*) , 
which is exactly (|6.5[) . We have therefore achieved out goal of generalising this inequality. 
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7. MALLIAVIN DIFFERENTIABILITY OF THE FLOW 

7.1. High order directional derivatives. Let x be in WGQ p (]R d ) and suppose that the vector 
fields V = (Vi, . . . , Vd) and Vo are smooth and bounded. For t £ [0, T] we let C/ t ^_ (•) denote the 
map defined by 

&t5-o (') : Vo i-> y t , 

where y is the solution to the RDE 

dy t = V(yt)dx t + Vo{yt)dt, y(0) = y O - (7.1) 

It is well-known (see [12]) that the flow (i.e. the map yo i— > U^ (yo)) is differentiable; its derivative 
(or Jacobian) is the linear map 



J t ^ ( yo )(.)= A[/x_ o(yo + e . ) 



€ L (R e ,K 6 ) 



=o 



If we let ^f^o (yo) denote the pair 

(vo) = (c^_o (^) , J^ (yo)) (K e , M e ) , 

and if W = (Wi, . . . , Wd) is the collection vector fields given by 

W t (y, J) = {Vi (y) ,VVi(y)-J), i = l,...,d, 

and 

W (y, J) = (V (y) , VVq (y) ■ J) 

then $£_o (2/0) is the solution^ to the RDE 

d$£_ = W ($?<_„) ^ + W (3£_ ) (ft, $^ |t=o = (yo, I) ■ 

In fact, the Jacobian is invertible as a linear map and the inverse, which we will denote (yo), is 
also a solution to an RDE (again jointly with the base flow U£_ (yo))- We also recall the relation 



J^ (y) ■ Jo^s (y) ■ 



ae €=0 
Notation 3. In what follows we will let 

acu (yo) = (u^ (yo) , JS_o (2/0) , JoVt (yo)) e R e © R exe e M exe . (7.2) 

For any path h in (7<?-var ([0, T] , R d ) with 1/q+l/p > 1 we can canonically define the translated 
rough path T^x (see [H]). Hence, we have the directional derivative 

D h u^M= ^c/ t T A x (yo) ■ 

fl£ e=0 

It is not difficult to show that 

D h u^o (yo) = E /* J ^ (»°) ^ (^o (yo)) dhl 
i=l - 70 

which implies by Young's inequality that 

|iW<-o (yo)l < C llA/.^o (yo)|| p . vai , [0 , t] \h\ q . Yax . m • (7.3) 



■*"A little care is needed because the vector fields have linear growth (and hence are not Lip-7. But one can exploit 
the 'triangular' dependence structure in the vector fields to rule out the possibility of explosion. See |12j for details. 
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In this section we will be interested in the form of the higher order directional derivatives 

D hl ... D hn U^ (y ) := - ^W-T^x ( 

oei,...de n ei =...= e „=o 

Our aim will be to obtain bounds of the form (|7.3[) . To do this in a systematic way is a challenging 
exercise. We rely on the treatment presented in [21) . For the reader's convenience when comparing 
the two accounts, we note that [3T] uses the notation 

(D s u^ (yo)) se[0 ,T] = PtUu-o (»o) . • ■ - .-Dfc^-o (yo)) sG[0jX] G R d 

to identify the derivative. The relationship between D S U£_ (yo) and DhU£_ (yo) is simply that 



DhU^ Q (y Q ) = £ / D*E/?<_ (»>) dA< 



Note, in particular D S U£_ (yo) = if t < s. 

Proposition 7.1. Assume x is m lyGfip (R d ) and Zet V = (Vi, . . . , Vd) be a collection of smooth 
and bounded vector fields. Denote the solution flow to the RDE |7. l\ l by 

C/£_ ( yo ) = (C/£_ (yo), C^_ (y„) e ) G K e 

Suppose q > 1 and n£fj and H{/n,...,/i„} fee any sufesef; o/ C q ~ var ([0, T] , R d ) . TTierc tte direc- 
tional derivative Df ll ■ ■ ■ Dh„U£_ (yo) exists for any t £ [0,T]. Moreover, there exists a collection 
of finite indexing sets 

{ K (»i,...,t n ) : (n, ■ • ■ , *n) G {1, ■ • -,d} n } , 
such that for every j G {1, .., e} we /iave i/ie identity 

d 

D hl ...D hn U^ (y )j= ]T E / f k 1 (t 1 )...f k n (t n )f k n+1 (t)dh\\...dh\i, 

h,...,i n =l fc € K (il , /0<*i<-<*"<* 

(7.4) 

/or some functions f k which are in C p var ([0, T] , K) /or every I and k, i.e. 

U(i 1> ..., iw ) e{ i,... )( j } » U feeK(ii in) {/,* : Z - 1, .., n + 1} C C*™ ([0, T] , R) . 

Furthermore, there exists a constant C , which depends only on n and T such that 

\ft U ari[0 ,T] < C (l + H A/X -o (2/o)II ot[0 ,t]) P (7-5) 
for every I = 1, . . . ,n + 1, every k G j n j and every (i\, . . . , i„) € {1, . . . , d}™. 

Proof. We observe that -Df^ . . . Dh„U£_ (yo)j equals 

E / ^:±^o(2/o) i ^---^:- ( 7 - 6 ) 



ii ,. . .,i n — 1 



0<ti<. ..<*„<* 



The representation for the integrand in ()7.6)) derived in Proposition 4.4 in |21j then allows us to 
deduce (|7T4l and ([73]) . □ 
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7.1.1. Malliavin differentiability. We now switch back to the context of a continuous Gaussian 
process (X t ) t<£ ^ T j = (X£, . . . ,Xf) t ^ Q T , with i.i.d. components associated to the abstract Wiener 
space (W, %, /i). Under the assumption of finite 2d p- variation we have already remarked that, for 
any p > 2p, X has a unique natural lift to a geometric p-rough path X. But the assumption of 
finite p-variation on the covariance also gives rise to the embedding 

U^C 9 -™ ([0,T],R d ) (7.7) 

for the Cameron-Martin space, for any 1/p + 1/q > 1 [7j Prop. 2]. The significance of this result 
it twofold. First, it is proved in [7J Prop. 3] that it implies the existence of a (measurable) subset 
VcW with (i (V) = 1 on which 

T h X (co) = X (w + h) , 
for all h e H simultaneously. It follows that the Malliavin derivative T>U t 



' x( o } M 



n 



VV*$ (yo) ■ V h U, 



X( W ) 



(yo) := ±U*«**> (y ) 



(7.8) 



coincides with the directional derivative of the previous section, i.e. 



X.(io+eh) 



(yo) 



dk u * 



T 5h x 



The second important consequence results from combining (|7.7I 



vufSo ] (yo) 



< c 



op 



M 



X(a 



(yo) 



(7.9) 

;=0 

7.9P and (|7.3j) . namely that 

(7.10) 



p-var;[0,t] 



If we can show that the right hand side of (|7.10[) has finite positive moments of all order, then 
these observations lead to the conclusion that 

Yt = U^u_ (yo) € n p >ilD> 1 ' p (M e ), 

where D fe ' p is the Shigekawa-Sobolev space (see Nualart). The purpose of Proposition 17.11 is to 
extend this argument to the higher order derivatives. We will make this more precisely shortly, but 
first we remark that the outline just given is what motivates the assumption 

n ^C q - vm ' ([0,T},R d ) 

detailed in Condition rffl. 

The following theorem follows from the recent paper [5]. It asserts the sufficiency of Condition 
[1] to show the existence of finite moments for the p- variation of the Jacobian of the flow (and its 
inverse) . 

Theorem 7.2 (Cass-Litterer-Lyons (CLL)). Let (Xt) te i 0T i be a continuous, centred Gaussian 
process in M d with i.i.d. components. Let X satisfy Condition{J\ so that for some p > 1, X admits 
a natural lift to a geometric p-rough path X . Assume V = (Vo, V±, . . . , Vd) is any collection of 
smooth bounded vector fields on M e and let U^_ (■) denote the solution flow to the RDE 

dU*_ (yo) = V (E/* o (y )) dX t + V {U*_ (y a )) dt, 

U£_ (y ) = yo- 



2 The requirement of complementary regularity in the Condition [T] then amounts to p £ [1, 3/2). This covers BM, 
the OU process and the Brownian bridge (all with p = 1) and fBm for H > 1/3 (taking p = 1/2H). For the special 
case of fBm one can actually improve on this general embedding statement via Remark 12.61 . The requirement of 
complementary then leads to the looser restriction H > 1/4. 
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Then the map U£_ (•) is differentiable with derivative J^_ (yo) G M exe ; J^_ (yo) is invertible as 
a linear map with inverse denoted by J^_ t (yo)- Furthermore, if we define 

M* (y ) ee (U£_ (y ) , J t * (vo) , J$- t (yo)) e^ffl R exe 8 M exe , 
and assume X satisfies Condition^ we have that 

ll^-o(yo)||„ [0 . T] e fl L9 ^)- 

Proof. This follows from by repeating the steps of [5] generalized to incorporate a drift term. □ 

Remark 7.3. Under the additional assumption that the covariance R has finite Holder- controlled 
p-variation, it is possible to prove a version of this theorem showing that: 

H^*oG/o)|| 1/p e f)V(p). 

q>l 

7.2. Proof that U£l Q ' (y ) G D°° (R e ). We have already seen that appropriate assumptions on the 
covariance lead to the observation that for all h G Ti, 

D h U t ^ >(y ) = T U t ^o '(yo) 

for all u in a set of p-full measure. We will show that the Wiener functional u i-> (yo) belongs 

to the Sobolev space D°° (R e ). Recall that 

E>°° (R e ) := n p> i D^ =1 B k ' p (R e ) , 

where D fc ' p is the usual Shigekawa-Sobolev space, which is defined as the completion of the smooth 
random variables with respect to a Sobolev- type norm (see Nualart [30]). There is an equivalent 
characterisation of the spaces B fc ' p (originally due to Kusuoka and Stroock), which is easier to use 
in the present context. We briefly recall the main features of this characterisation starting with the 
following definitions. Suppose E is a given Banach space and F : W —}E is a measurable function. 
Recall (see Sugita [21]) that F is called ray absolutely continuous (RAC) if for every h G H, there 
exists a measurable map Fh ■ W — >E satisfying: 

F (•) = F h (•) , n - a.e., 

and for every oj G W 

i 1 ^ _F)j ((j + t/i) is absolutely continuous in t G M. 

And furthermore, F is called stochastically Gateaux differentiable (SGD) if there exists a measurable 
G : W — s-Z/ ("H, £), such that for any he H 

- [F (■ + th) - F (•)] A G (w) (h) ast^O, 

where A indicates convergence in ^-measure. 

If F is SGD, then its derivative G is unique p-a.s. and we denote it by VF. Higher order 
derivatives are defined inductively in the obvious way. Hence V n F (oS) (if it exists) is a multilinear 
map (in n variables) from % to E. 

We now define the spaces D fe ' p (R e ) for 1 < p < oo by 

D 1,p (R e ) := {F G L p (R e ) : F is RAC and SGD, VF G L p (Z (H, R e ))} , 
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and for k = 2, 3, 

§fc,P ( R e) := j> g fs>k-hP • p F g D^-l.P (i(-H,M e ))| . 

Theorem 7.4 (Sugita [31]). For 1 < p < oo and k e N we have B fe 'P (JR e ) = O fe 'P (R e ). 
It follows immediately from this result that we have 

(R e ) = n p> i n£L x B k > p (R e ) . 

With these preliminaries out the way, we can prove the following. 

Proposition 7.5. Suppose (Xt) te ^ T j is an R d -valued, zero-mean Gaussian process with i.i.d com- 
ponents associated with the abstract Wiener space (W,%, /i). Assume that for some p > 1, X lifts 
to a geometric p-rough path X. Let V = (Vo, Vi, . . . , Vd) be a collection of C°° -bounded vector fields 
on Wt e , and let U^_q (yo) denote the solution flow of the RDE 

dY t = V (Y t ) dX t (w) + V (Y t ) dt, Y (0) = y . 

Then, under the assumption that X satisfies Condition]]^ we have that the Wiener functional 

is in ©°° (K e ) for every t e [0, T}. 

Proof. We have already remarked that Condition [1] implies that on a set of /x-full measure 

ThX (lj) = X (lj + h) (7.11) 

for all h G 71. It easily follows that C/ t< _: (yo) is RAC. Furthermore, its stochastic Gateaux derivative 
is precisely the map T>U^_q (yo) defined in (|7.8[) . The relation (|7.1ip implies that the directional 
and Malliavin derivatives coincide (on a set of /i-full measure ) hence VUf^ ] (y ) € L(H, R e ) is the 
map 

VU*$ (y ):h*D h u£tf (y ) . 

We have shown in (|7. 10[) that 

{y0) Wo P - C H M --° (2/0) l-var;[0,T] > ( 7 " 12 ) 

where 

M* (y ) = (£/£_„ (y ) , J t * (2/0) , (2/0)) • (7.13) 
It follows from Theorem 17.21 that 

ll^oM|^ r . [0iT] en^(p). 

Using this together with (|7T2|) proves that (y ) is in n p> iB 1 ' p (R e ) which equals n p> iD 1 ^ (IT) 

by Theorem 17.41 

We prove that (y ) is in n p> iO fc - p (K e ) for all k e N by induction. If (y ) £ 

pfc-i,p (R e ) then, by the uniqueness of the stochastic Gateaux derivative, we must have 

V^uf^ (y ) fa,.. .,/i fe -i) = D hl . ..D hh U%$> (y ) . 

It is then easy to see that T> k ~ 1 U^2Q (yo) is RAC and SGD. Moreover, the stochastic Gateaux 
derivative is 

V k U%$ (y ) :(Zn,..., h k ) = D hl ...D hk (yo) . 
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It follows from Proposition 17.11 together with Condition [T] that we can bound the operator norm of 

(fe+i)p 



V k ufSt ] (Vo) in the following way: 



ll^^o ) (yo)Ho P <c(i+ M.t7(yo) 



p-var;[0,T] 



for some non-random constants C > 0. The conclusion that U^Zq (yo) £ n p >iD fc,p (K e ) follows at 
once from Theorems 17.21 and 17.41 □ 

8. Smoothness of the density: the proof of the main theorem 



This section is devoted to the proof of our Hormander type theorem 13.51 As mentioned in 
the introduction, apart from rather standard considerations concerning probabilistic proofs of 
Hormandcr's theorem (see e.g. [H]), this boils down to the following steps: 

(1) Let W be a smooth and bounded vector field in R e . Following [2T], denote by (^ t w/ ) ig j T j 
the process 

Zf= J^ t W(U* (y )). (8.1) 

Then assuming Conditions [2] and [3] we get a bound on \Z w \oo in terms of the Malliavin 
matrix Ct defined at (|6.1j) . This will be the content of Proposition [831 

(2) We invoke iteratively our Norris lemma (Theorem 15. 6|) to processes like Z w in order to 
generate enough upper bounds on Lie brackets of our driving vector fields at the origin. 

In order to perform this second step, we first have to verify the assumptions of Theorem 15. 61 for 
the process M^_ (y ) defined by (|7. 13[) . Namely, we shall see that M^_ (y ) is a process controlled 
by X in the sense of Definition 15.11 and relation (|5.6p . 

Proposition 8.1. Suppose (X t ) t ^ Q r i satisfies the condition of Theorem \7.2\ In particular, X has 

a lift to X, a geometric-p rough path for some p > 1 which is in C°'T([0,T];GLpJ (R d )) forj=l/p. 
Then M^_ (jjq) is a process controlled by X in the sense of Definition 1 5. 1\ and 

P>1 

Proof. For notational sake, the process M^_ (y ) will be denoted by M only. It is readily checked 
that M is solution to a rough differential equation driven by X, associated to the vector fields given 

by 

F i (y,J,K) = (V i (y),VV i (y)-J,-K-VV i (y)), i = 0,...,d. (8.2) 
This equation can be solved either by genuine rough paths methods or within the landmark of 
algebraic integration. As mentioned in Proposition [53J both notions of solution coincide thanks to 
approximation procedures. This finishes the proof of our claim M € Q^- 

In order to prove integrability of M as an element of Q^c, let us write the equation governing 
the dynamics of M under the form 

d 

dM t = F (M t ) dt + Y, F l (M t ) dXl, 

i=l 

where X is our Gaussian rough path of order at most N = 3. The expansion of M as a controlled 
process is simply given by the Euler scheme introduced in [121 Proposition 10.3]. More specifically, 
M admits a decomposition (|5.2p of the form: 

Mi jt = Mi<°(t -s) + Al^ X# + X;;r + Rtf, 
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with 

Mi>° = FZ(M a ), M>* = Fi(M t ), Mi>^ = F i2 Fl(M a ), \W a f \ < c M \t - . 



With the particular form (|8.2[) of the coefficient F and our assumptions on the vector fields V, it is 
thus readily checked that 

p/|| e , < c v (f + 1| JUL + || j- 1 ^ + 1| J|| 7 + \\u\\ 7 ) , 

and the right hand side of the latter relation admits moments of all order thanks to Theorem 17.21 
and the remark which follows it. □ 

Define £ x (jjq, 9, T) to be the quantity 

£ x (y , 6, T) := 1 + L e (x)' 1 + \y \ + p/. x ^ (j/o)|| Q3 + M x , y 

Corollary 8.2. Under the assumptions of Provosition we have 

C x (y ,9,T)e f]LP(n). 
P>1 

Proof. We recall that the standing assumptions imply that ||X|| 7 .r T i has a Gaussian tail (see (|2.8j) 
from Section [2]). It is easily deduce from this that 



A/x, 7 G f) LP (fi) . 
p>i 

Similarly we see from Corollary 15.81 and Proposition 18.11 that Lg (x)^ 1 and ||M;^_ (yo)||g^ have 
moments of all orders and the claim follows. □ 

Definition 8.3. We define the sets of vector fields Vk for k 6 N inductively by 

Vi = {Vi-.i = l,...,d}, 

and then 

K+i = {[V h W] : i = 0, 1, . . . , d, W G V n } . 

Proposition 8.4. Let (^"t)te[oT] = i-^-t ^ ■ ■ ^t) t e[o t] ^ e a con ti nuous Gaussian process, with 
i.i.d. components associated to the abstract Wiener space (W, %,/!). Assume that X satisfies the 
assumptions of Theorem Iff. 51 Then there exist real numbers p and 9 satisfying 2/p > 9 > a/2 
such that: (i) X is 9 -Holder rough and (ii) X has a natural lift to a geometric p rough path X in 
C°< 1 /P([0,T};G^{R d )). Forte(0,T] let 

d „ 

°t = E / J ** * <&>) V i <X.) <8> J^ s , (yo) Vi (Y s , ) dR (s, s') , 

and suppose k G N U {0}. Then there exist constants /i = \i (fc) > and C = C (t, k) > such that 
for all W G and all »£l e with \v\ = 1, we have 

\{^ zW )L- M <CC*{yo,e,tY (v T C t v)r (8.3) 



SMOOTHNESS OF GAUSSIAN RDES 



37 



Proof. Let us prove the first assertion. To do this, we note that the constraint on p implies that X 
lifts to a geometric p-rough path for any p > 2p. Because the p-variation is assumed to be Holder- 
controlled, it follows that X is in C°' 1/p ([0, T]; GLpJ (R d )). By assumption a < 2/p, therefore we 
may always choose p close enough to 2p in order that 

2 a 

On the other hand X is 6- Holder rough for any 8 > a/2 by Corollarv l5.8l Hence there always exist 
p and 6 with the stated properties. 
We have that 



d . 

v T C t v = V Al with A* = / f (s) /' (a') di? (s, «') 
£f -AM 2 



(8.4) 



where we have set p (s) :— (y, j£_ s (yo) Vi (y s )) = \v, Z^)- Furthermore, because the hypotheses 
of Thcorem l6.9l and Corollary 16. 101 arc satisfied, we can deduce that 



I/' 



oo;[0,{] 



< 2 max 



A 



i 1 1/2 



1 



a; 



,7/(27+a) 



ia/(27+a) 



_E[X?f 2 '' Vc 

for i = 1, . . . , d. On the other hand Young's inequality for 2D integrals (see [14]) gives 

2 

_l I t l CnM 

l7;[o,t] 



+ |r(0)|]V p (i?; [0,t] 2 ). 



(8.5) 



(8.6) 



From (|8.6|) . (|8.5[) and the relation v T Ctv = ^j=i ^ ^ follows that there exists some Ci > 0, 
depending on t and c, such that we have 



1/ L.m.i < Ci [v C t v) .mfflt^ 



loo;[0,t] 

Using the fact that for some v > 
If (0)1 + 1/ 



h;[o,t] 



h;[o,t] 



r (o)| + |/ 

< C 2 £x (yo, 0, t) u for i = 1, .., d 



a/(2 7 +a) 



it is easy to deduce that (|8.3[) holds whenever W € Vi. 

The proof of (|8.3[) for arbitrary fc G N now follows by induction. The key relation comes from 
observing that 

(v,Z?) = (v,W(y )) + J2 f U (v,Z^W)dXU 

i=l •'° 

in the sense of Proposition [53J Hence, assuming the induction hypothesis, we can use Theorem 
to obtain a bound of the form (18.31) on 



v, Z\ 



[Vi,W] 



;[o,t] 



for all W eV k - Since V fc+ i = {[V i} W] : i = 0, 1, . . . , d, W G V„}, the result is then established. □ 

We are now in a position to prove our main theorem. Since the structure of the argument is the 
classical one, we will minimise the amount of detail where possible. 
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Proof of Theorem \3.5l This involves assembling together the pieces we have developed in the paper. 
First let 2/p > 8 > a/2 be chosen such that X is ^-Holder rough and X has a natural lift to a 
geometric p rough path X in C 0,1 / p ([0, 1]; G^J (R d )). This is always possible by the first part of 
Proposition 18.41 Let < t < T and note that we have shown in Proposition 17.51 that (yo) is 

in D°° (R e ). The result will therefore follow by showing that for every q > 0, there exists c\ = C\ (q) 
such that 



P ^in^ (v,C t v) < ej < ae q , 

for all e <G (0, 1). It is classical that proving (detCV)" 1 has finite moments of all order is sufficient 
for U^q* 1 (y ) to have a smooth density (see, e.g., [30]). 

Step 1: From Hormander's condition there exists N e N with the property that 

span {W (y ) : W e U^V;} = R e . 
Consequently, we can deduce that 



a:= inf £ \(v,W (y )) \ > 0. (8.7) 

\v\ — 1 

For every W £ U^V.; we have 



u|=l 



\(^Z w )\ oc][ot] >\(v,W(y ))\, (8.8) 

and hence using (|8.7|) . (|8.8p and Proposition 18.41 we end up with 

a< inf sup \{v,Z w )\ < Cl £x(y ,e,tf M \v T C t v\\ (8.9) 

|f |=i w^ufL ± Vi ' M =1 

for some positive constants ci, fx = /zjv and 7r = ttn- 
Step 2: From (|8.9p can deduce that for any e G (0,1) 

p f urf |w T a«| < < p (r x (i/O,*,*)" >c 2 e~ fc ) 

for some constants c 2 > and fc > which do not depend on e. It follows from Corollary 18 . 2 1 that 
for every q > we have 

P ( inf \v T C t v\ < e ) < c 3 e fc,? , 
VM=i' 1 / 

where c 3 = c 3 (g) > does not depend one. □ 

9. Appendix 

Proo f of Lemma \6.5\ We prove the result for S — T, the modifications for S < T arc straight 
forward. Consider three nested sequences (A m )™ =1 , (5 m )™ =1 and (C m )m=i consisting of partitions 
of [0, s], [s, £] and [i, T] respectively, and suppose that the mesh of each sequence tends to zero as 
m tends to infinity. For each mi and rri2 in N let D mi ' m2 denote the partition of [0, T] defined by 

D m ^=A mi UB m2 UC mi . 

We now construct an increasing function r : N — > N such that 

together form a nested sequence of partitions of [0, T] having the needed properties. 
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We do this inductively. First let m = 1, then for every two consecutive points u < v in the 
partition B m Lemma 16.41 implies that 

Cov (Z s>t Z UiV \ T A " V .F c ") -> Cov (Z^t^l -F s ,t V F t:T ) ■ 

as n — > oo. Z has positive conditional covariance, therefore the right hand side of the last expression 
is positive. This means we can choose r (1) to ensure that 

Cov (Z 3tt Z UtV \ V J rCr<1) ) > 0, (9.1) 

for every two consecutive points u and v in B m (the total number of such pairs does not depend 
on r(l)). We then let D\ = D^ 1 '' 1 , both properties 2 and 3 in the statement are easy to check; the 
latter follows from (|9.1[) . when we interpret the Schur complement as the covariance matrix of Z\ 
conditional on Z\ (see also the proof of Proposition [676]) . Having specified r (1) < . . . < r (k — 1) we 
need only repeat the treatment outlined above by choosing some natural number r (fc) > r (k — 1) 
to ensure that 

Cov (Z s . t Z u ^\F A ^ V F c ^) > 0, 

for each pair of consecutive points u < v in B^. It is easy to verify that (-D m )™ = i constructed in 
this way has the properties we need. □ 
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